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TITLE 

BRANCH METRIC COMPUTATION AND ADD -COMPARE- SELECT OPERATION 

IN VITERBI DECODERS 

BACKGROUND OF THE INVENTION 

5 Field of the Invention 

The invention relates to digital communication systems, 
and more particularly to branch metric computation and add- 
compare- select operation in Viterbi decoders. 
Description of the Related Art 

10 In wired and wireless applications, especially wireless 

LAN (WLAN) , finite signal and noise powers lead to a strong 
probability that any bit in any message may be corrupted. 
These corrupted bits incur errors in wireless transmission, 
causing troubles like lower data rates or lost 

15 transmissions. Forward error correction (EEC) techniques 
have been developed to account and correct for corrupted 
bits. The purpose of FEC is to improve the ' capacity of a 
channel by adding some carefully designed redundant 
information to the data being transmitted through the 

20 channel. The process of adding this redundant information 
is known as channel coding. Convolutional coding and block 
coding are two major forms of channel coding. Convolutional 
codes operate on a continuous input stream, one or a few 
bits at a time, which offer an alternative to block codes 

25 for transmission over a noisy channel. 

A convolutional encoder is a Mealy machine, where the 
output is a function of the current machine state and the 
current input. In general, convolutional codes are 
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described by two parameters : the coding rate and the 
constraint length. The coding rate, k/n, is expressed as a 
ratio of the number of bits into the convolutional encoder 
(k) to the number of channel symbols output by the 
5 convolutional encoder in) in a given encoder cycle. The 
constraint length, K, denotes the ^'length" of the 
convolutional encoder, i.e., how many A:-bit stages are 
available to feed the combinational logic that produces the 
output symbols. 

10 In 1967, Andrew J. Viterbi introduced a decoding 

algorithm for convolutional codes which has become known as 
the Viterbi algorithm. The Viterbi algorithm is defined in 
terms of a graph which is called a trellis diagram. A 
trellis diagram is a 2 -dimensional array of nodes that are 

15 connected with arcs, namely branches. Each column in the 

array contains 2^"^ nodes which represent the 2^"^ states of 
the convolutional encoder. In each stage of the encoding 
procedure, the encoder receives a k-hit symbol, outputs an 
n-bit symbol based on its current state and the input 

20 symbol, and enters a new state which is a function of its 
current state and the input symbol- An arc in the trellis 
diagram from row p in column i to row u in column / + 1 
corresponds to the encoder going from state p at instant i 
of the encoding procedure to state u at instant i + l. When 

25 applied to a sequence of symbols received from a memory- less 
channel, in an iterative manner, the Viterbi algorithm finds 
the maximum likelihood path through the trellis diagram and 
thereby outputs the decoded data. 

Viterbi decoding has been the subject of several United 

30 States patents including U.S. Pat. No. 5,878,060, U.S. Pat. 
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No. 5,912,908, and U.S. Pat. No. 6,415,415. An advantage of 
the Viterbi decoding is the fixed decoding time. The 
Viterbi algorithm is well suited to hardware decoder 
implementation, but its computational requirements grow 
5 exponentially as a function of the constraint length. 
Traditionally, performance and silicon area are the two most 
important concerns in VLSI design. In recent years, power 
dissipation has also become an important concern, especially 
in battery-powered applications, such as cellular phones, 

10 laptop computers, and WLAN adapters. Compared with a K=5 
convolutional code for use in GSM systems, the convolutional 
code specified in the IEEE 802.11a/g standard has a 
constraint length of K=7 . As the constraint length 
increases, the decoding process in the receiver becomes 

15 increasingly complicated and consumes more power. Several 
studies have shown that the Viterbi decoder consumes more 
than one- third of the chip area and the power dissipation of 
the baseband modem. Accordingly, there is a need for a 
novel and efficient scheme that can reduce the complexity 

20 and power consumption of the Viterbi decoder. 

SUMM2^RY OF THE INVENTION 

The present invention is generally directed to an 
apparatus for branch metric computation and add- compare - 
select operation in a rate l/n Viterbi decoder with a 
25 constraint length of K. The apparatus of the invention 
comprises a branch metric generator and an add-compare- 
select unit. The branch metric generator receives a data 
symbol including n decision metrics in g-bit representation 
to calculate a plurality of branch metrics each of which is 
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a measure between the currently received data symbol and a 
corresponding branch label. A branch metric difference is 
further pre- calculated by subtracting a first branch metric 
of a transition from state Sp to state Su at instant / from a 
5 second branch metric of another transition from state Sq to 
state Su at instant/. 

The add-compare- select unit receives the branch metric 
difference,, the first and the second branch metrics at 
instant i from the branch metric generator. It calculates a 

10 path metric difference between a path metric of state Sp at 
instant i-l and another path metric of state Sg at instant 
i-l. Two decision bits of states 5„ and S^ at instant / are 
set, respectively, based on the branch metric difference at 
instant / and the path metric difference. The add-compare- 

15 select unit includes a first adding means for calculating a 
new path metric for state Su at instant / and a second adding 
means for calculating another new path metric for state Sy 
at instant /. According to the decision bit of state Su at 
instant /, the first adding means is capable of selectively 

20 adding the path metric of state Sg at instant i-l and the 
second branch metric of the transition from state Sg to 
state Su at instant / or adding the path metric of state Sp 
at instant i-l and the first branch metric of the transition 
from state Sp to state Su at instant /. According to the 

25 decision bit of state Sv at instant i, the second adding 
means is capable of selectively adding the path metric of 
state Sg at instant i-l and the first branch metric of the 
transition from state Sp to state Su at instant / or adding 
the path metric of state Sp at instant i-l and the second 

30 branch metric of the transition from state Sg to state Su at 
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instant i. The add-compare- select unit preferably comprises 
means for selectively outputting one of the new path 
metrics, which is a survivor path metric of a local winner 
state, by predetermining the local winner state between 
5 states Su and Sv at instant / based on the decision bits of 
states Su and Sv at instant and the sign of the path 

metric difference at instant /—I or the sign of the branch 
metric difference at instant /. Note that states Sp and Sg 
at instant i-l and states Su and 5v at instant i are 

10 organized in a butterfly trellis structure. 

According to another aspect of the invention, an add- 
compare- select apparatus for a Viterbi decoder 'comprises a 
subtracter, a A-bit multiplexer, a A-bit unsigned 
comparator, two combinational -logic circuits, and two adding 

15 means. The subtracter calculates a path metric difference 
by subtracting a path metric of state Sg at instant i-l from 
another path metric of state Sp at instant i-l, where the 
path metrics are represented by . a bits of precision, 
respectively. The A-bit multiplexer selectively provides an 

20 output between A least significant bits of a branch metric 
difference at instant / and the negative thereof according 
to a select signal, where the branch metric difference is 
represented by bits of precision and yS=A+l. The -1-bit 
unsigned comparator generates a comparison result by 

25 comparing the magnitude of X least significant bits of the 
a-bit path metric difference and the magnitude of the A-bit 
multiplexer output. 

A first combinational -logic circuit is responsible for 
logically operating S most significant bits of the a-bit 

30 path metric difference and a sign bit of the branch metric 
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difference at instant i to predetermine whether the 
magnitude of the a-bit path metric difference is greater or 
less than that of the branch metric difference, where 
S=a—X. If this predetermination is met, a decision bit is 
5 set for state Su at instant i based on the predeterminations- 
otherwise, the decision bit of state Su at instant / is set 
to be consistent with the comparison result. On the other 
hand, a second combinational -logic circuit is responsible 
for logically operating S most significant bits of the a-bit 

10 path metric difference and the sign bit of the branch metric 
difference at instant / to predetermine whether the 
magnitude of the a-bit path metric difference is greater or 
less than that of the negative of the branch metric 
difference. If this predetermination is met, a decision bit 

15 is set for state at instant / based on the 

predetermination; otherwise, the decision bit of state 5v at 
instant / is set to be consistent with the comparison 
result . 

According to the decision bit of state Su at instant i, 
20 the first adding means can calculate a new path metric for 
state Su at instant / by selectively adding the path metric 
of state Sq at instant i-l and a branch metric of a 
transition from state Sq to state Su at instant i or adding 
the path metric of state Sp at instant /-I and another 
25 branch metric of a second transition from state Sp to state 
Su at instant /, where the branch metrics are represented by 
X bits of precision- According to the decision bit of state 
Sv at instant the second adding means can calculate 

another new path metric for state S^ at instant i by 
30 selectively adding the path metric of state Sq at instant 
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i-l and the branch metric of the second transition from 
state Sp to state S„ at instant / or adding the path metric 
of state Sp at instant i-l and the branch metric of the 
transition from state Sq to state Su at instant i, 
5 Preferably, the branch metric difference is pre-calculated 
by subtracting the branch metric of the second transition 
from state Sp to state Su at instant i from the branch metric 
of the transition from state Sg to state Su at instant i. 

According to yet another aspect of the invention, a 

10 rate l/n Viterbi decoder with a constraint length of K is 
composed of a dummy insertion unit, a branch metric 
generator, multiple add- compare -select units, and a survivor 
memory unit. The dummy insertion unit performs a dummy 
insertion procedure, which is inverse to a bit-stealing 

15 procedure in a transmitter, on a sequence of decision 
metrics in Q-h±t representation according to a puncturing 
pattern; it also outputs a dummy insertion flag to indicate 
a position at which a dummy value is inserted into the 
decision metrics. The branch metric generator receives n 

20 number of the decision metrics including the dummy value to 
group into a data symbol to calculate a plurality of branch 
metrics each of which is a distance between the data symbol 
and a corresponding branch label. When the branch metrics 
are calculated for the data symbol, the inserted dummy value 

25 is ignored in response to the dummy insertion flag. 
Further, a branch metric difference is pre-calculated for a 
pth sub-group of states including states Sp, Sg, Su and Sv by 
subtracting a first branch metric of a transition from state 
Sp to state Su at instant / from a second branch metric of 

30 another transition from state Sg to state Su at instant /. 
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Preferably, states Sp and Sg at instant i-l and states Su and 
Sv at instant / are organized in a butterfly trellis 
structure . 

Among the P add- compare- select units, the pth unit 
5 receives the first branch metric of the transition from 
state Sp to state Su, the second branch metric of the 
transition from state Sg to state Su and the branch metric 
difference for the pth sub-group of states at instant / from 
the branch metric generator. The /7th add-compare-select 

10 unit also calculates a path metric difference between a path 
metric of state Sp at instant i-l and another path metric of 
state Sg at instant i-l. With the pth add-compare-select 
unit, a pair of decision bits are set for states 5„ and Sy at 
instant / based on the branch metric difference at instant i 

15 and the path metric difference. In addition, new path 
metrics are generated for states Su and Sv at instant /, 
respectively. The pth add-compare-select unit can further 
predetermine a local winner state between states Su and Sv at 
instant i based on the decision bits of states Su and Sv at 

20 instant z, and the sign of the branch metric difference at 
instant / or the sign of the path metric difference. One of 
the new path metrics, which is a survivor path metric of the 
local winner state at instant /, is provided as output. 
Thus P add-compare-select units can reduce half the output 

25 number of path metrics . The survivor memory unit receives P 
survivor path metrics of P local winner states and P pairs 
of decision bits at instant / from the P add-compare-select 
units; it is thus responsible for storing survivor path 
sequences and yielding a decoded bit sequence. 
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DESCRIPTION OF THE DRAWINGS 

The present invention will be described by way of 
exemplary embodiments, but not limitations, illustrated in 
the accompanying drawings in which like references denote 
5 similar elements, and in which: 

FIG. 1 is a block diagram of a Viterbi decoder 
according to an embodiment of the invention; 

FIG. 2 is a graph showing a basic unit of the butterfly 
trellis structure ; 
10 FIG. 3 is a block diagram of an add- compare- select unit 

according to an embodiment of the invention; 

FIG. 4A is a logic diagram of a first combinational - 
logic circuit according an embodiment of the invention; 

FIG. 4B is a logic diagram of a second combinational - 
15 logic circuit according an embodiment of the invention; and 

FIG. 5 is a detailed block diagram illustrating a 
survivor memory unit of FIG. 1 according to the invention . 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention will now be described with 
20 reference to a wireless communication system that conforms 
to the IEEE 802.11a/g standard. According to the invention, 
the communication system need not be wireless and the 
conformant 802.11a/g transceiver referred to herein is 
merely an exemplary illustration. In the example of a 
25 conformant 802.11a/g transceiver, the convolutional encoder 
generates two output bits for every input bit using a code 
with a constraint length K=7, Therefore, its Viterbi 
decoder has a coding rate k/n equal to 1/2. The constraint 
length K=7 means that there are 2^"^ = 64 states of the 
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encoder (since the seventh bit is the input bit) . These 
states are designated as state So (decimal 0, binary 000000) 
to state Ses (decimal 63, binary 111111), The IEEE 802 . lla/g 
standard adopts puncturing technique to achieve higher data 
5 rate. Puncturing is a bit-stealing procedure for omitting 
some of encoded bits in the transmitter, thereby reducing 
the number of transmitted bits and increasing the coding 
rate. For example, if a bit sequence of Ai Bi A2 B2 has the 
bit B2 omitted, it becomes Ai Bi A2 p where p denotes the 
10 punctured bit. Instead of transmitting four encoded bits to 
represent two bits of data (coding rate R = 2/4 or 1/2), 
three bits are transmitted now to represent the same two 
bits of data for R = 2/3, A rate R = 3/4 can be generated in 
a similar manner . 

15 Referring to FIG. 1, a rate l/n Viterbi decoder with a 

constraint length of K according to an embodiment of the 
invention is illustrated and identified by the reference 
numeral 100. The Viterbi decoder 100 has four main blocks 
including a dummy insertion unit 110, a branch metric 

20 generator 120, an add-compare-select (ACS) module 130, and a 
survivor memory unit 140. The dummy insertion unit 110 is 
responsible for inserting dummy data into the Viterbi 
decoder 100 on the receive side in place of the previously 
punctured bits. It performs a dummy insertion procedure 

25 inverse to the bit -stealing procedure according to a 
puncturing pattern (i.e., the coding rate). In this regard, 
the dummy insertion unit 110 retrieves a sequence of 
decision metrics from a buffer in the preceding module by 
generating two column addresses, ADDRO and ADDRl, where the 

30 decision metrics are arranged in order of time received and 
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the earliest one is in address 0. The preceding module can 
deliver either hard-decision or soft-decision values to the 
Viterbi decoder 100. If the decision metrics are quantized 
to one-bit precision, the result is called hard-decision 
5 data. If the decision metrics are quantized with more than 
one bit of precision, the result is called soft-decision 
data. In the case of the conformant 802.11a/g receiver, the 
dummy insertion procedure can be classified into three 
operating modes and represented by way of the following 
10 expression: 

Mode 0 (coding rate R=l /2) 

for (count = 0; count-H-; count < Ndbps) 
{ 

ADDRO = count x 2 ; 
ADDRl = count x 2 + 1 ; 

} 

Mode 1 (coding rate R = 2/3) 

for (count = 0; count-^; count < Nobps) 

i 

if ({count mod 2) = = 0) 
{ 

ADDRO = count x 3/2 ; 
ADDRl = count x 3/2 + 1 ; 

} 

if ((count mod 2) == 1) 

{ 

ADDRO = (count - 1) x 3/2 + 2 ; 

/* insert a dummy value, ADDRl: don 't care *l 

} 



Mode 2 (coding rate = 3 / 4) 

for (count = 0; count++\ count < Ndbps) 

{ 

if ((count mod 3) == 0) 

{ 

ADDRO = count x 4/3 ; 
ADDRl = count x 4/3 + 1 ; 

} 

if ((count mod 3) 1) 
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ADDRO = (count - 1) x 4/3 + 2 ; 

/* insert a dummy value, ADDRl i don 7 care */ 

} 

if {{count mod 3) ==2) 

{ 

/* insert a dummy value, ADDRO: don 't care */ 
ADDRl = (count - 2) x 4/3 + 3; 

} 

I . 

Note that a cycle-based counter, count, is built in the dummy 

insertion unit 110 to generate the count from 0 to Nosps^^t 
where Ndbps is the number of data bits per OFDM symbol. 
Furthermore, the dummy insertion unit 110 outputs a dummy 
5 insertion flag, DIF, to indicate a position at which a dummy 
value is inserted into a sequence of decision metrics. 
Using the column addresses ADDRO and ADDRl, Ncbps decision 
metrics are retrieved in Ndbps clock cycles according to the 
puncturing pattern, where Ncbps is the number of coded bits 

10 in an OFDM symbol. After retrieving Ncbps decision metrics, 
the operation of the dummy insertion unit 110 is suspended 
and the built-in counter is reset to zero. 

The branch metric generator 12 0 accepts the dummy 
insertion flag DIF and a sequence Y at its inputs as shown 

15 in FIG. 1. For example, the sequence Y can be formed with 
the appropriately inserted dummy values and the retrieved 
decision metrics that are soft-decision values in Q-bit 
representation. Every group of n values in the sequence Y 
is organized into a data symbol yt. It should be understood 

20 to those skilled in the art that 1-bit hard-decision values 
may also substitute for the exemplificative soft-decision 
values. The branch metric generator 120 is responsible for 
branch metric computation which provides a measurement. 
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Such a measurement, referred to as the branch metric, is an 
inverse of the likelihood that a given branch from a current 
state to a next state is correct. For a rate \/n Viterbi 
decoder, there are two possible state transitions and thus 
5 two branch metrics are calculated for each next state by 
measuring the difference between the received data symbol at 
a given instant and two associated branch labels. A branch 
label on a given branch specifies which value the encoder 
output. Although there are 64 encoder states for 802.11a/g, 

10 a total of four branch metrics are calculated for each data 
symbol because the rate 1/2 encoder can only output a 
symbol having a maximum of four possible values. 
Considering the dummy insertion, the branch metric generator 
12 0 ignores the appropriately inserted dummy values in 

15 response to the dummy insertion flag when calculating the 
branch metrics. In FIG. 1, the branch metrics generated at 
instant i by the branch metric generator 12 0 are referred to 
as BM^ . More particularly, a branch metric of a transition 
from state S' to state S at instant / is denoted by BM^^'^^ . 

20 According to the invention, the branch metric generator 120 
further pre-calculates a branch metric difference for the 
subsequent add- compare -select operation, which will be 
described in detail later. 

Still referring to FIG. 1, the ACS module 130 receives 

25 the output of the branch metric generator 120 and generates 
a path metric PM^ and a decision bit for each next 

state. According to the invention, the trellis diagram for 
the rate l/n encoder with a constraint length of K is 
organized in a butterfly structure. FIG. 2 shows a basic 

30 element of the butterfly trellis structure. Four possible 
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state transitions from instant / to instant i-\ are 
illustrated. The following relations shown in FIG. 2 are 
established for a rate l/n, constraint length K Viterbi 
decoder : 

P = 0, h 2 2^-2-1 

g = 2^-^ + p 
u = 2p 
v = 2p + l 

5 Here PM^ denotes a path metric of the survivor path 
entering state 5 at instant /. It is important to note that 
u is even and v is odd. This implies that an odd (even) 
state is reached only if the source input bit is '1' ('0'). 
In other words, state iS^ can be reached from either state Sp 
^ 10 or Sg when the source input bit is '0' . Similarly, state Sv 
can be reached from either state Sp or Sg when the source 
input bit is M'. In Viterbi decoding, the decision bit 
indicates which sum of a source path metric and a branch 
metric generated the smallest result and was selected as a 
15 new path metric. If the decision bit of a state at instant 
/ is '0' , the associated upper branch (the dashed line in 
FIG. 2) is the survivor path leading to a predecessor state. 
If it is M', the lower branch (the solid line in FIG. 2) 
will be chosen. 

20 One important property of the convolutional code specified 
in IEEE 802.11a/g is that BM^^"'^"^ = BM^^''^''^ and 
BM^^''^''^ ^BM^^^'^^K This property can be applied to jointly 
calculate the path metrics PAff" and PAff* . For calculating 
PMf" , instead of finding the minimum between PUfj^ -h BMj^"*^"^ 
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and PMf^^-\-BMf''^''^ , the values of = PMf_\ - PMfj^ and 

Zi=BM^^^'^"'> -BmI^''^"^ are compared. If P^<Z,. , it means 
PAffjy + BM^^^*^"^ < PMfj^ + BM^^'^"^ , and vice versa . Now for 
calculating PM^^ , the values of = PMfj^ - PMfj^ and 

5 - Z,. = BM^^^'^"^ - 5m}^^'^"^ = BmI^^'^"^ - wj^^**^"^ are compared , where 

and -Z,. are opposite numbers. As mentioned earlier, the 
branch metric difference is pre -calculated in the branch 

metric generator 12 0 by subtracting BM^^^*^"^ from BM^^"'^"^ . 
The other branch metric difference, - Z^- , can be obtained 

10 from the negative of Z,- . Note that both calculations of 
PM?" and PMf"" share the term , and thus one computation 
can be reduced. Also, the additions and subtractions are 
carried out with smaller bit -width. This results in lower 
complexity and power consumption. 

15 The ACS module 13 0 preferably comprises P number of ACS 

units each responsible for a basic butterfly element. The 
number of necessary ACS units is equal to half the number of 

total states, that is, P = 2^'*^ . FIG. 3 shows a detailed 
block diagram of the pth ACS unit in the ACS module 130. 

20 The /?th ACS unit 300 is made up of a subtracter 310, a A-bit 
multiplexer 320, a A-bit unsigned comparator 330, two 
combinational -logic circuits 340 and 350, and two adding 
means 360 and 370. To begin with, the branch metrics are 
represented by X bits of precision, in which X is given by: 

25 X^Q + n-\ 

The number of bits required to represent the branch metric 
difference is >9=A+1. On the other hand, the path metrics 
are represented by a bits of precision and a is given by an 
equation of the form: 
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a = l + flog2(/2./r(2^-l))] 

where [-] denotes a ceiling function which returns the 

smallest integer greater than or equal to the given numeric 

expression. For example, the decision metrics are quantized 

5 into 32 levels, i.e., they are represented by Q = 5 bits of 

precision. In this case, a is 10, /? is 7, and A is 6 for a 

rate 1/2, K=7 Viterbi decoder {n = 2) . It should be 

understood that a, J3, and A stand for the optimums according 

to the present disclosure; they may also be replaced with 

10 other values, as will be appreciated by those skilled in the 

art. The subtracter 310 calculates the path metric 

s s 

difference by subtracting PM^2\ from PM^^^ . The 6-bit 

multiplexer 32 0 selectively provides an output between 6 
least significant bits (LSBs) of Z,- and -Z,- according to a 
15 select signal SEL, The 6-bit unsigned comparator 33 0 is 
used to generate a comparison result, C, by comparing the 
magnitude of 6 LSBs of the 10-bit path metric difference, 
JJ^[5:0], and the magnitude of the 6-bit multiplexer output, 
Z;[5:0]. 

20 The combinational -logic circuits 340 and 350 are 

responsible for logically operating S most significant bits 
(MSBs) of the a-bit path metric difference, the comparison 
result C, and a sign bit of the branch metric difference at 
instant /, where ^=a-A. In the case of a rate 1/2, K^l 

25 Viterbi decoder, 4 MSBs of the 10-bit path metric 
difference, f^[9:6], are applied to the combinational -logic 
circuits 340 and 350 both. As shown in FIG. 3, the 
comparison result C and the sign bit of the 7-bit branch 
metric difference, Z,.[6] , are applied to the combinational - 
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logic circuits 340 and 350 as well. According to the 
invention, the combinational -logic circuit 340 is capable of 
predetermining whether the magnitude of the path metric 
difference is greater or less than that of the branch metric 
5 difference based on W^[9:6] and -^,[6] . If the 

predetermination is met, i.e., it gives a decisive answer, a 
decision bit, df" , can be set for state Su at instant i based 
on the predetermination; otherwise, df" is set to be 
consistent with the comparison result C. On the other 

10 hand, the combinational - logic circuit 350 is capable of 
predetermining whether the magnitude of the path metric 
difference is greater or less than that of the negative of 
the branch metric difference based on ff^[9:6] and Z,.[6] - If 
so, a decision bit, df'' , can be set for state Sv at instant / 

15 based on this predetermination. Provided that the 

predetermination is not met, i.e., it cannot give a decisive 
answer, ^ is set to be consistent with the comparison 
result C. 

Referring to FIG. 4A, a logic diagram of the 
20 combinational -logic circuit 340 according to an embodiment 
of the invention is illustrated. It is seen that the 
combinational -logic circuit 34 0 is implemented with AND 
gates 401a-g, NAND gate 403, OR gates 405a-d, NOR gates 
407a-b, and NOT gates 409a-b. The logic diagram of FIG. 4A 
25 can be summarized by the following Boolean expressions: 

^0 = ^ [9]rZJ6] + ((^.[9] ' Z,[6]) (^.[8] ■ ^.[7] ■ 

u, = [9] . Z, [6]-f((p ^.[9] • Z,[6]). (f^.[8] + W,[7] + ^^.[6])) 

df" = u,+(c-(uo-i-t/J) 

5£:L = P^.[9].Z,[6] + J^.[9].Z,[6] 
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As such, W^[9:6] and 2',[6] are used to predetermine the 
magnitude comparison between and Z,- . When the 

predetermination is met, i.e., either Wq or Mj is '1', the 
comparison result C from the 6-bit unsigned comparator 33 0 
5 is disabled by the AND gate 401g. As a result, df" follows 
the value of Mj . In this case, the decision bit can be set 
for state Su at instant / based on the predetermination. 
Conversely, both Wq and become '0' when and Z,- have 

the same sign and fJ^[8:6] cannot give additional information 

10 about the magnitude comparison. Therefore, the 

predetermination is not met and the comparison result C is 
enabled to set df" . Note that the select signal SEL is '0' 
when and both have the same sign. Accordingly, the 6- 
bit multiplexer 320 selects Z,.[5:0] for the 6-bit comparator 

15 33 0 so that the comparison result C is generated 
irrespective of the sign by comparing the magnitude of 
H^[5:0] and Z,.[5:0] . Similarly, a logic diagram of the 
combinational -logic circuit 350 according to an embodiment 
of the invention is illustrated in FIG. 4B. As depicted, 

20 the combinational -logic circuit 350 is implemented with AND 
gates 411a-g, NAND gate 413, OR gates 415a-d, NOR gate 417, 
arid NOT gates 419a-b. The Boolean expressions for the logic 
diagram of FIG. 4B are described as follows: 

Vq = ^[9] ■ Z,[6] + \Sw,[9\zMyim • ^[7] - 

V, = WX9\ Z, [6]+fc [9] . Z,[6fe.[8] + W,m + ff^-[6])) 

^^=Vi+(C'(vo+Vi)) . 

Also, Wi[9\6\ and Z,.[6] are used to predetermine the magnitude 
25 comparison between and -Z. . When the predetermination 

is met, i.e., either Vq or v, is M', the comparison result 
C is disabled by the AND gate 411g so that df" follows the 
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value of Wi . On the other hand, both Vq and Vj become '0' 
when and -Z, have the same sign and H^[8:6] cannot give 
additional information about the magnitude comparison. 
Therefore, the predetermination is not met and the 
5 comparison result C is enabled to set df"" . Note that the 
select signal SEL is when and -Z,. both have the same 

sign. Accordingly, the 6-bit multiplexer 320 selects 
-ZJ5:0] as the input to the 6-bit comparator 330 so that the 
comparison result C is generated irrespective of the sign 
10 by comparing the magnitude of W^[5:0] and -Z,[5:0]. It should 
be appreciated by those skilled in the art that either the 
combinational -logic circuit 340 or 350 is contemplated to 
provide the select signal SEL by the principles of the 
invention. 

15 In light of the foregoing description, it is shown that 

the combinational -logic circuits 34 0 and 3 50 can reduce the 
number of comparisons required during the ACS operation. If 
necessary, AND gates 401a-d, NAND gate 403, OR gate 405a, 
and NOT gate 409a-b are shared between the combinational - 

20 logic circuits 340 and 350 to reduce circuit complexity. 
With the 6-bit multiplexer 320, the 6-bit unsigned 
comparator 330 is capable of serving Z,.[5:0] and -Z,[5:0] one 
at a time. This is because and Z,. are the same sign 

while and -Z. have opposite signs, and vice versa. By 

25 sharing the 6-bit unsigned comparator 33 0 between even state 
Su and odd state Sy, further reduction in complexity and 
power dissipation can be accomplished. For IEEE 802,lla/g, 
this sharing can save a Viterbi decoder up to 32 unsigned 
comparators . 
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Turning back to FIG. 3, the adding means 360 can 
calculate PMf" by selectively adding PMfj^ and Bm\^''^"'^ or 
adding /W,^ and BM> according to d^" . Likewise, the 
adding means 370 can calculate PA/,?*' by selectively adding 
5 PMfjy and Bm\^''^''^ or adding PAdfj!^ and BM^^''^""^ according to 
rf,. " . In addition, use of the best -state decoding strategy 
locates the state having the smallest path metric. 
Typically, this can be done by comparing the path metrics of 
all states with subtractions. According to the invention, 

10 the unique architecture of the /7th ACS unit 300 lends itself 
to pre-compare PM^" and PAf ^- " without subtractions, whereby 
a saving of half the output number of path metrics at 
instant i can be achieived. This pre -comparison also enables 
the subsequent survivor memory unit 14 0 to gain significant 

15 reduction in latency, circuit complexity, and power 
consumption- In this regard, the pth ACS unit 3 00 

preferably comprises means for predetermining a local winner 
state between states Su and Sv at instant / based on df" , df" , 
and the sign of or . The method for predetermining a 

20 local winner state is described as follows: 

Case \: df" = 0 and df^ = 1 

If >0, the local winner state is Sv\ otherwise, the local winner state is Su. 
Case 2: df" = 1 and df^ = 0 

If >0, the local winner state is Sul otherwise, the local winner state is 
Case 3: df" = 0 and df^ = 0 

If > 0, the local winner state is Su\ otherwise, the local winner state is Sy, 
Case 4: df" = 1 and df^ = 1 

If Zj- > 0, the local winner state is Sy\ otherwise, the local winner state is Su* 
As a result, every iteration the ACS module 130 sends 
current decision bits of 64 ( 2^"^ ) states and current path 
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metrics of 32 ( 2^"^ ) local winner states to survivor memory 
unit 140. 

The survivor memory unit 140 is a type of survivor path 
decoder. Referring to FIG. 5, the survivor memory unit 140 
5 is constituted by a best survivor unit 510, a register- 
exchange network 520, and a trace-back unit 530. The best 
survivor unit 510 simultaneously receives the path metrics 

of 32 ( 2^"^ ) local winner states at consecutive time 
instants from the ACS module 13 0. As stated above, each 

10 local winner state is chosen from a pair of odd and even 
states. Due to the elaborate design of the invention, the 
best survivor unit 510 needs to find a best state with the 
smallest path metric only once every L iterations, where the 
number of L is equal to a divisible factor of a data payload 

15 length for a conformant 802.11a/g system. In one 

embodiment, L is equal to 8. The best survivor unit 510 
includes / 2-to-l comparators for choosing the best state 
among the 32 local winner states by comparing 32 path 
metrics of the local winner states in 7 {L-l) iterations, 

20 leaving the 8th iteration for the best state latching into 
the trace-back unit 530. The number of 2-to-l comparators, 
y, is given by: 
'2^-2-1" 

y = 

L-l 

Therefore, ;^ is 5 for the conformant 802.11a/g system. In 7 
25 ACS iterations, each of the 2-to-l comparators is used to 
compare two out of 32 path metrics to find the minimum path 
metric among the 32 local winner states. Hence, the state 
associated with the minimum path metric is selected as the 
best state, S*_o . 
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As depicted, the register- exchange network 520 receives 

the decision bits of 64 ( 2^"^ ) states from the ACS module 
13 0. In the register -exchange network 52 0, associated with 
every state is a register which contains the survivor path 
5 leading to that state. Each survivor path is uniquely 
specified by and stored as a sequence of decision bits along 
the survivor path. According to the decision bits from 
instant i-L (Z = 8) to instant i, decision vectors of 
survivor paths leading to the 64 states at instant / can be 

10 generated, respectively. For example, a decision vector of 
the survivor path to state S from instant i-L to / is given 
by the recursive update 

dl^, = (d^y^,j^ « 1 + df)[L -1:0] 
where is the predecessor state of S as determined by its 

15 decision bit df from the ACS operation. The current state 
decision df is used to select the predecessor state 
decision vector which is left shifted to allow df to be 
appended to the vector. This update occurs concurrently for 
all states, hence the name register-exchange, since each 

20 update corresponds to an exchange of the register contents 
modulo the shift and append. If L=S; the above expression 
is rewritten as: 

rff-8,i=(<9./-i «l + ^f)[7:0] 
where the decision vector df_^j has a length of 8 (L ) bits. 

25 The trace-back unit 530 utilizes a backward processing 

algorithm for survivor path update and decode. Such an 
algorithm requires the decision bits to be stored in a 
memory prior to tracing back the survivor path. In general, 
the trace-back recursion estimates the previous state Si-i 

30 given the current state Si as the following update 
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5,-1 = ({d^ « (K-l)-^ 5, ) » 1)[^ - 2 : 0] 
which corresponds to 1-bit right shift of the current state 
register with input equal to the current state decision df . 
According to the invention, the trace-back recursion from 
5 instant i to i — L can be collapsed in a single trace-back 
recursion of the form 

Si-i = « (K-l) + S,) » L)[K -2:0] 

If Z = 8 and K=7; the above expression can be rewritten as: 
5,_8 = « 6 + 5, ) » 8)[5 : 0] = df_,, [7 : 2] 

10 which means that the predecessor state at instant /-8 is 
determined by 6 MSBs of the survivor path sequence from the 
state 5* at instant /, rf,?8,-[7:2] . As described earlier, the 
register-exchange network 520 of length L is capable of 
calculating the survivor path sequence from instant i-L to i 

15 for each state (i.e., the decision vector df__^j ) prior to 
trace-back. Using this pretrace-back technique, the trace- 
back recursion iterates from instant / to i — L in a single 
trace-back operation. Accordingly, every 8 {L ) iterations 
the best state and the decision vectors of all states are 

20 fed to the t race-back unit 530. Then the trace-back unit 
53 0 stores the decision vectors of the 64 states and finds a 
global survivor path sequence by following the decision 
vectors back from the best state at instant /-8, 5*_8 - In 
this manner, 8 decoded bits are output from the trace -back 

25 unit 530 every 8 iterations. More particularly, the novel 
design of the survivor memory unit 14 0 reduces the required 
chip area and power consumption compared to conventional 
techniques. 

While the invention has been described by way of 
30 example, and in terms of the preferred embodiments, it is to 
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be understood that the invention is not limited to the 
disclosed embodiments. To the contrary, it is intended to 
cover various modifications and similar arrangements (as 
would be apparent to those skilled in the art) . Therefore, 
5 the scope of the appended claims should be accorded the 
broadest interpretation so as to encompass all such 
modifications and similar arrangements. 
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